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ABSTRACT 

The Baryon Oscillation Spectroscopic Survey (BOSS) has collected spectra for over 
one million galaxies at 0.15 < z < 0.7 over a volume of 15.3 Gpc^ (9,376 deg^) - 
providing us an opportunity to study the most massive galaxy populations with van¬ 
ishing sample variance. However, BOSS samples are selected via complex color cuts 
that are optimized for cosmology studies, not galaxy science. In this paper, we sup¬ 
plement BOSS samples with photometric redshifts from the Stripe 82 Massive Galaxy 
Gatalog and measure the total galaxy stellar mass function (SMF) at z ^ 0.3 and 
z ^ 0.55. With the total SMF in hand, we characterize the stellar mass completeness 
of BOSS samples. The high-redshift CMASS (“constant mass”) sample is significantly 
impacted by mass incompleteness and is 80% complete at log 2 o(A 7 ,/MQ) > 11.6 only 
in the narrow redshift range z = [0.51,0.61]. The low redshift LOWZ sample is 80% 
complete at logiQ(M,/M 0 ) > 11.6 for z = [0.15,0.43]. To construct mass complete 
samples at lower masses, spectroscopic samples need to be significantly supplemented 
by photometric redshifts. This work will enable future studies to better utilize the 
BOSS samples for galaxy-formation science. 

Key words: Galaxies: abundances - evolution - stellar content. Cosmology: obser¬ 
vations. 


1 INTRODUCTION 

There is tremendous interest in constraining the size, stel¬ 
lar mass, and halo mass evolution of the most massive 
galaxies in the universe (Tojeiro et al. 2012; Tal et al. 2013; 
Maraston et al. 2013; Beifiori et al. 2014; Marchesini et al. 
2014; van de Sande et al. 2015; Marsan et al. 2015, to cite 
a few recent examples). The evolution of these proper¬ 
ties places strong constraints on models of galaxy forma¬ 
tion which traditionally have difficulty reproducing observed 
trends such as the amplitude of the stellar mass function at 
the highest masses {e.g., Benson et al. 2003; Maraston et al. 
2013; Knebe et al. 2015), although much progress has been 
made in the past few years {e.g., Furlong et al. 2014; Benson 
2014). 


From an observational standpoint, spectroscopic sam¬ 
ples of massive galaxies present key advantages over pho¬ 
tometric samples. For example, errors in stellar mass esti¬ 
mates are reduced for spectroscopic samples compared to 
photometric redshift samples. Typical 5-band photometric 
redshifts at z ~ 0.5 have an error of = 0.03 to = 0.05 
even for the most massive galaxies. This error translates 
into a stellar mass uncertainty of ^0.1 dex which may domi¬ 
nate the total stellar mass error budget. Spectroscopic sam¬ 
ples of massive galaxies are also key in order to perform 
accurate measurements of clustering and/or galaxy-galaxy 
leasing which place tight constraints on the galaxy-halo 
connection {e.g., Mandelbaum et al. 2006; Leauthaud et al. 
2012; Coupon et al. 2015; Zu & Mandelbaum 2015). Finally, 
the spectra themselves contain key information and can 
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be used to constrain stellar ages, star formation histories 
(SFHs), dust extinctions and stellar velocity dispersions 
{e.g., Chen et al. 2012; Thomas et al. 2013). 

For these reasons, from a galaxy formation perspec¬ 
tive, large samples of massive galaxies with spectroscopic 
redshifts over a wide redshift range are highly desir¬ 
able. Surveys such as zCOSMOS (Lilly et al. 2007), VVDS 
(Le Fevre et al. 2004, 2015), DEEP2 (Newman et al. 2013), 
PRIMUS (Coil et al. 2011), and VIPERS (Guzzo et al. 
2014) provide spectroscopic samples that probe the 0.2 < 
2 < 1.0 universe and complement the Sloan Digital Sky 
Survey (SDSS, York et al. 2000) main sample &t z — 0.1 
(Strauss et al. 2002). These higher redshift surveys, how¬ 
ever, still cover relatively small areas ranging from a few 
square degrees to a few tens of square degrees (e.j.,VIPERS 
covers 24 deg^). The volumes probed by these surveys are 
insufficient to provide statistically significant samples of the 
most massive galaxies {log{M,/M q) > 11.5) which have low 
number densities (n ~ 2 x 10~® Mpc“^). For example, the 
highest mass bin at 0.48 < a < 0.74 in the Leauthaud et al. 
(2012) COSMOS analysis of the stellar-to-halo mass relation 
only contains 300 galaxies at log(M*/M 0 ) > 11.29, only 
71 of which have masses greater than log(M*/MQ) > 11.5. 
The highest mass bin in the Coupon et al. (2015) analysis, 
which covers a wider area (23.1 deg^), contains 6326 galax¬ 
ies at 0.5 < z < 1.0 and log{Mt/M q) > 11.2. However, 
this bin only contains 498 galaxies at log(M*/M©) > 11.5, 
of which only 234 have a secure spectroscopic redshift from 
the VIPERS, VVDS, or PRIMUS surveys. Because of these 
small samples sizes, studies of the stellar-to-halo mass re¬ 
lation, for example, remain poorly constrained at the very 
high-mass end. 

An exciting opportunity is the Baryon Oscilla¬ 
tion Spectroscopic Survey (BOSS, Eisenstein et al. 2011; 
Dawson et al. 2013), which, with the final DR12 data release 
(Alam et al. 2015), has collected spectra for more than one 
million galaxies massive galaxies (log(M*/M 0 ) > 11 . 0 ) at 
0.15 < z < 0.7 over a volume of 15.3 Gpc® (9,376 deg^), 
providing the potential to study the most massive galaxy 
populations with vanishing sample variance. However, the 
sample selections of BAO surveys involve complex color cuts 
that are optimized for cosmology studies, not galaxy science. 
While a number of BOSS galaxy studies have been published 
(Tojeiro et al. 2012; Maraston et al. 2013; Guo et al. 2013; 
Beifiori et al. 2014; Guo et al. 2014; Montero-Dorta et al. 
2014, 2015; Reid et al. 2014; Guo et al. 2015, to cite a few), 
the stellar mass completeness of the BOSS samples remains 
poorly understood. 

In this paper, the second in a series, we use a new 
compilation of wide-held survey data, the Stripe 82 Massive 
Galaxy Gatalog (s82-mgc), to address this problem. Paper 
I (Bundy et al., in preparation) describes the construction 
of S82-MGC, which matches the 2 magnitudes deeper “SDSS 
Goadd” optical photometry (Annis et al. 2011) in the equa¬ 
torial Stripe 82 with the Large Area Survey (LAS) near- 
IR photometry from the UKIRT Infrared Deep Sky Sur¬ 
vey (UKIDSS, Lawrence et al. 2007). Supplementing with 
a variety of photometric redshifts, S82-MGC enables near- 
IR based stellar mass estimates for complete samples with 
log(M*/M 0 ) > 11.2 and z < 0.7. 

In this paper, we use the S82-MGC to investigate the 
stellar mass completeness of the two main BOSS spectro¬ 


scopic samples, the LOWZ sample at 0.15 < « < 0.43 and 
the CMASS (“Constant Mass”) sample at 0.43 < « < 0.7. 
Our characterization of the completeness is made with re¬ 
spect to a measurement of the galaxy stellar mass function 
using the S82-MGC. The evolution of the mass function, a de¬ 
tailed study of potential biases, and implications for galaxy 
growth are discussed in Paper HI (Bundy et al., in prepara¬ 
tion). Here, we provide convenient fitting formulae which can 
be used to estimate the completeness of each of the BOSS 
samples as a function of stellar mass and redshift (Sections 

6.1 and 6 . 2 ). 

This work is complementary to the analysis performed 
by Montero-Dorta et al. (2014) who studied the magnitude 
and color completeness of the high-redshift BOSS CMASS 
sample. This paper focuses on stellar mass completeness for 
the full CMASS and LOWZ samples, an aspect that is not 
addressed in Montero-Dorta et al. (2014). 

Upcoming surveys such as the Hyper Suprime Cam sur¬ 
vey^ (HSC) and the Euclid survey (Laureijs et al. 2011) will 
be able to use photometric redshifts to supplement spec¬ 
troscopic samples and to construct mass limited samples 
over wider redshift and mass ranges than using spectroscopic 
samples alone. We characterize the level to which spectro¬ 
scopic samples need to be supplemented by photometric red¬ 
shifts as a function of mass and redshift (see Section 3). 

Finally, a better understanding of the BOSS selection 
functions will also enable the construction of improved mock 
catalogs that are critical to investigating the link between 
galaxies and their host halos. Our companion paper (Saito et 
al., in preparation) presents improved mock catalogs that ac¬ 
count for the stellar mass completeness of the BOSS CMASS 
sample as a function of redshift. 

The layout of this paper is as follows. Section 2 presents 
the data used in this paper. Section 3 discusses the charac¬ 
teristics of of sample. Section 4 broadly describes the effects 
of the BOSS sample selection. Section 5 presents our es¬ 
timate of the total stellar mass function as a function of 
redshift. Section 6 describes our completeness estimates for 
CMASS and LOWZ. Section 7 presents mass completeness 
estimates for several previous studies that used BOSS data. 
Finally, our summary and conclusions are presented in Sec¬ 
tion 8 . We assume a ACDM cosmology with Dm = 0.274, 
Ho = 70 km s“^ Mpc“^. Stellar mass is noted M* and has 
been derived using a Chabrier Initial Mass Function (IMF, 
Chabrier 2003). 

2 DATA 

2.1 BOSS Spectroscopic Samples 

BOSS is a spectroscopic survey of 1.5 million galaxies over 
10,000 deg^ that was conducted as part of the SDSS-HI 
program (Eisenstein et al. 2011) on the 2.5 m aperture 
Sloan Foundation Telescope at Apache Point Observatory 
(Gunn et al. 1998, 2006). A general overview of the BOSS 
survey can be found in Dawson et al. (2013), the BOSS spec¬ 
trographs are described in Smee et al. (2013), and the BOSS 
pipeline is described in Bolton et al. (2012). BOSS galaxies 
were selected from Data Release 8 (DR 8 , Aihara et al. 2011) 

^ http://www.naoj.org/Projects/HSC/HSCProject.html 
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ugriz imaging (Fukugita et al. 1996) using a series of color- 
magnitude cuts motivated by the Maraston et al. (2009) 
LRG model. This is a passive template mainly dominated 
by a metal-rich population but also includes a small metal- 
poor population (3% by mass) to mimic observed metallic- 
ity gradients of local massive elliptical galaxies. This tem¬ 
plate was shown to fit the observed-frame Sloan colors of 2- 
SLAQ galaxies at redshift 0.4-0.6 better than models which 
include star formation (Wake et al. 2006). This conclusion 
was also independently confirmed by Montero-Dorta et al. 
(2015). The choice of a passive template was motivated 
by the intention of selecting the most massive and passive 
galaxies for BAO studies. 

The BOSS selection uses the following set of color cri¬ 
teria: 

C|| = 0.7(ymod rmod) T 1.2[(rmod fmod) 0.18](1) 

C_L — (rmod imod) (^?mod r’niod)/4 0.18 (2) 

d_L = (rmod fmod) (Pmod r’mod)/8.0 (3) 

The subscript “mod” denotes model magnitudes, which 
are derived by adopting the better fitting luminosity pro- 
Hle between a de Vaucouleurs and an exponential luminos¬ 
ity proHle in the r-band (Stoughton et al. 2002). The sub¬ 
script “cmod” denotes composite model magnitudes, which 
are calculated from the best-fitting linear combination of 
a de Vaucouleurs and an exponential luminosity prohle 
(Abazajian et al. 2004). PSF magnitudes are denoted with 
the subscript “psf”. BOSS color cuts are computed using 
model magnitudes, whereas magnitude cuts are computed 
using cmodel magnitudes. All magnitudes are corrected for 
Galactic extinction using the dust maps of Schlegel et al. 
(1998). 

BOSS targeted two primary galaxy samples: the LOWZ 
sample at 0.15 < 2 < 0.43 and the GMASS sample at 0.43 < 
z < 0.7. The LOWZ sample is an extension of the SDSS 
I/II Luminous Red Galaxy (LRG) sample (Eisenstein et al. 
2001) to fainter magnitudes and is defined according to the 
following selection criteria: 


|cu| 

< 

0.2 

(4) 

^cmod 

< 

13.6-bC||/0.3 

(5) 

16 <C 'Tcmod 

< 

19.6 

(6) 

'^psf '^cmod 

> 

0.3 

(7) 


Equation 4 sets the color boundaries of the sample; 
equation 5 is a sliding magnitude cut which selects the 
brightest galaxies at each redshift; equation 6 corresponds 
to the bright and faint limits and equation 7 is to separate 
galaxies from stars. In a similar fashion to the SDSS I/II 
Luminous Red Galaxy (LRG) sample, the LOWZ selection 
primarily selects red galaxies. Over most of the BOSS foot¬ 
print, roughly one third of the LOWZ sample has a spec¬ 
trum from SDSS-II (these objects were not re-observed by 
BOSS). Galaxies with a spectrum that pre-dates the BOSS 
survey will be referred to interchangeably either as “Legacy 
objects” or “SDSS KNOWN”. 

The CMASS sample targets galaxies at higher redshifts 
with a surface density of roughly 120 deg“^. CMASS tar¬ 
gets are selected from SDSS DR8 imaging according to the 
following cuts: 


|dx| 

> 

0.55 

(8) 

^cmod 

< 

19.86-b 1.6(du -0.8) 

(9) 

17.5 

< 

^cmod 'b 19.9 

(10) 

^mod ^mod 

< 

2 

(11) 

ifih2 

< 

21.5 

(12) 


where ifibz is the estimated i-band magnitude in a 2" aper¬ 
ture diameter assuming 2” seeing. Star-galaxy separation on 
CMASS targets is performed via: 

ipsf-imod > 0.2-b 0.2(20.0 - imod) (13) 

Zpsf ■^mod ^ 9.125 - 0.462niod. (14) 

In addition to CMASS, BOSS also targeted a smaller 
ancillary sample known as “CMASS SPARSE” that was de¬ 
signed to test the impact of the color-magnitude cuts. The 
SPARSE sample includes fainter and bluer galaxies by ex¬ 
tending the sliding cut to: 

icmod < 20.14 -b 1.6(dx - 0.8). (15) 

The SPARSE region (contained between the fiducial 
CMASS sliding cut and the SPARSE sliding cut) is ran¬ 
domly down-sampled to yield about 5 objects per square 
degree. Beyond highlighting the SPARSE region in certain 
hgures, we do not utilize this sample in this paper. Instead, 
our analysis relies on photometric redshifts to probe the full 
massive galaxy population that extends beyond the BOSS 
color boundaries (see Section 2.2). 

The CMASS sample was originally designed to loosely 
follow a constant stellar mass cut at 0.4 < 2 < 0.6 (see 
Figure 1 in Maraston et al. 2013) and to allow for a wider 
range of galaxy colors than either the SDSS-II LRG or the 
LOWZ samples. Using high-resolution HST/AGS imaging. 
Masters et al. (2011) show that roughly 26% of the GMASS 
galaxies in the GOSMOS survey are morphologically classi¬ 
fied as late-types (with an observed color of g — i < 2.35). 
Using a maximum likelihood approach that accounts for 
photometric errors as well as the CMASS selection cuts, 
Montero-Dorta et al. (2014) estimate that 37% of CMASS 
object may intrinsically belong to the blue cloud. 

The CMASS sample is thought to be more complete at 
higher stellar masses than the SDSS-II LRG and LOWZ 
samples which are color-selected (Anderson et al. 2014). 
However, we will demonstrate in Section 6.2 that the op¬ 
posite is in fact true and that in certain redshift ranges, the 
LOWZ sample is more complete is terms of stellar mass than 
CMASS (see Section 6.2). 

In this paper, we use the internal DRIO BOSS data 
release (Ahn et al. 2014). However, our analysis is limited 
to the Stripe 82 region of the BOSS survey (described in 
Section 2.2) which was observed in the early phases of the 
BOSS survey. Because DRll and DR12 provide no new data 
in Stripe 82, our analysis and conclusion would be identical 
using these later data releases. The main samples and cata¬ 
logs that are used in this paper are: 

(i) The original catalog that was used to target 
BOSS galaxies. From this catalog, we extract the orig¬ 
inal set of LOWZ and CMASS targets. We will re- 
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fer to the original target samples as “TARJLOWZ” and 
“TAR_CMASS”. Galaxies that were part of the tar¬ 
get catalog, but did not obtain a fiber due to fiber- 
collisions are designated as “FIBER_COLLIDEDJLOWZ” 
and “FIBER_COLLIDED_CMASS”. 

(ii) The BOSS DRIO large-scale structure (LSS) cata¬ 
log created by the BOSS galaxy clustering working group 
that was used in the Anderson et al. (2014) analysis. We 
only give a brief description of this catalog and refer the 
reader to Anderson et al. (2014) for additional details. This 
catalog is constructed by trimming the BOSS redshift file 
(the “SpALL” file) by the BOSS mask. Legacy objects 
are added to the catalog (Legacy objects are not con¬ 
tained in the SpALL file). Finally, Legacy objects are down- 
sampled in each sector to match the BOSS completeness. 
The LSS catalog contains the flag “IMATCH” which in¬ 
dicates galaxies with BOSS redshifts (IMATCH= 1) and 
galaxies with Legacy redshifts (IMATCH= 2). Galaxies from 
the LSS catalogs will be referred to as “LSSJLOWZ” and 
“LSS_CMASS”^ 

(hi) The BOSS stellar mass catalog from the Portsmouth 
group (Maraston et al. 2013). 

Further details and the url’s of these catalogs are pro¬ 
vided in the Appendix. 

2.2 The Stripe 82 Massive Galaxy Catalog 

A key data set for this paper comes from observations of 
the Stripe 82 region along the celestial equator in the region 
of the southern galactic sky—a narrow, but deeper subset 
of the SDSS survey area—for which it is possible to con¬ 
struct a galaxy sample with a well understood complete¬ 
ness function. Stripe 82 is critical for this paper for two 
reasons. First, it was the subject of repeat imaging cam¬ 
paigns in SDSS, especially by the SDSS Supernova Survey 
(Frieman et al. 2008). These data have been combined into 
the “SDSS Coadd” by Annis et al. (2011) and reach roughly 
2 magnitudes deeper than the single epoch SDSS imaging 
with a 5cr detection limit of r 22.5. This added depth is 
critical for obtaining reliable photometric redshifts (photo- 
zs) for massive galaxies that can be used to supplement the 
color-selected BOSS samples out to 2 ~ 0.7. Second, this re¬ 
gion was also imaged by the UKIRT Infrared Deep Sky Sur¬ 
vey (UKIDSS, Lawrence et al. 2007) which provides near-IR 
photometry for ensuring robust stellar mass estimates. 

The Stripe 82 Massive Galaxy Catalog (s82-mgc) com¬ 
bines these data sets and delivers matched ugrizYJHK 
photometry using catalog-level synthetic aperture photom¬ 
etry (Bundy et al. 2012). Galaxies are separated from stars 
following Baldry et al. (2010) and via a combination of light 
profile and color cuts. Redshift information is provided by 
a number of sources including all SDSS spectroscopic red¬ 
shifts available in DRIO and several photometric redshift 
estimates. Among these are neural-network-based photo-zs 
from Reis et al. (2012) and iterative template-fitting esti¬ 
mates from the red-sequence Matched-filter Probabilistic 
Percolation {redMaPPer, Rykoff et al. 2014). The typical 
photo-z precision is az — 0.03 at 2 ^ 0.7. 

^ Note that we do not apply the large-scale systematic weights 
that are applied to the BOSS clustering analyses. 


A redshift estimator, 2beBt, is built for S82-MGC that em¬ 
ploys spectroscopic redshifts whenever possible and chooses 
among three photometric-based redshift estimates other¬ 
wise. In order of priority, 2best is set to: 

(i) 2spec: spectroscopy from SDSS-II or from BOSS. 

(ii) 2 a: for cluster members. 

(iii) 2red: for field early type galaxies. 

(iv) 2phot: photometric redshift from Reis et al. (2012). 

The scatter and bias in the photometric-based redshift 
estimators are characterized in Paper 1. 

Mangle (Swanson et al. 2008) and custom software is 
used to apply geometric masking of regions with imaging 
artifacts in the SDSS coadds and UKIDSS imaging and ar¬ 
eas around bright stars. The BOSS acceptance mask as well 
as rejection masks for collisions with the plate post and po¬ 
tentially higher priority BOSS targets are also applied^. 

Fiducial stellar mass estimates for the S82-MGC are de¬ 
scribed in Paper I and compared to other publicly available 
estimates from the BOSS survey. A more detailed examina¬ 
tion of M* estimators and potential biases is presented in Pa¬ 
per III. Briefly, the fiducial M* estimates are the result of the 
SED-fitting code described in Bundy et al. (2010) applied to 
the SDSS-I-UKIDSS PSE-matched photometry and the de¬ 
fined 2best- Stellar population templates are derived using 
BC03 models (Bruzual & Chariot 2003) and the STELIB 
empirical library (Le Borgne et al. 2003) and assuming a 
Chabrier (2003) Initial Mass Function (IMF). For a prior 
grid of SED templates spanning a range of ages (restricted 
to be less than the age of the universe at the redshift of 
the galaxy), metallicities, dust extinction, and exponential 
T star formation histories, a mass probability distribution is 
obtained by scaling the model M/L ratios by the inferred 
luminosity in the observed K-band. The median of this dis¬ 
tribution is taken as the stellar mass estimate. 

Starting from the S82-MGC parent catalog, we select the 
“UKWIDE” sub-sample (see Paper I) by rejecting sources 
in masked regions and requiring all classihed galaxies be 
observed on UKIDSS-LAS frames with 5a detection limits 
deeper than yji7A=[20.3,20.0,19.6,19.5] on the AB system 
of Oke & Gunn (1983). The final UK WIDE sample spans 
an area of 139.4 deg^. 


3 SAMPLE CHARACTERISTICS 

Figure 1 displays the stellar mass and redshift distribu¬ 
tions for the main galaxy samples considered in this pa¬ 
per compared to the full galaxy population from the s82- 
MGC. The S82-MGC catalog extends to lower mass limits 
compared to the BOSS samples and is mass complete to 
log]^g(M/MQ) = 11.2 at 2 = 0.7 (Paper I). 

For the LOWZ sample, because of a targeting error^ 
related to star-galaxy separation in the early phase of the 
BOSS survey, a cut of TILE> 10324 must be applied in order 
to select a uniform sample of LOWZ galaxies. However, the 
Stripe 82 region was unaffected by this error (see Figure 2 

® For details on BOSS masks, see: http://www.sdss3.org/drl0/ 
tutorials/lss_galaxy. 

^ See Parejko et al. 2013 and http://www.sdss3.org/dr9/ 
algorithins/boss_galaxy_ts. php 
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Figure 1. Mass and redshift distributions of several galaxy sam¬ 
ples considered in this paper (indicated by colored symbols). The 
underlying black points and grey contours show all galaxies from 
the S82-MGC as a function of mass and redshift. The S82-MGC cat¬ 
alog extends to lower mass limits compared to the BOSS spectro¬ 
scopic samples. 


in Parejko et al. 2013) and this cut is unnecessary for the 
S82-MGC. 

Figure 2 compares the number densities of various sam¬ 
ples (e.g., Legacy, LOWZ, CMASS) to the estimated number 
density of stellar mass threshold samples from the S82-MGC. 

Large imaging surveys that will overlap with the BOSS 
footprint such as the HSC and Euclid surveys will be able 
to build mass limited samples using a combination of spec¬ 
troscopic redshifts and photometric-based estimators using 
a strategy similar to the one we have used here. The ra¬ 
tio A^spec/A^phot represents a useful quantity when consid¬ 
ering trade-offs between the mass limits of such samples 
and errors introduced by supplementing spectroscopic sam¬ 
ples with photometric redshifts. Figure 3 presents the origin 
of redshifts that contribute to Zbest for three stellar mass 
thresholds. At z < 0.61, it is possible to construct mass- 
limited samples with logjQ(M/MQ) > 11.6 for which more 
than 80% of the sample has a spectroscopic redshift. At 
logiQ{M/M q) > 11.0, however, spectroscopic samples must 
be significantly supplemented by photometric redshifts, ex¬ 
cept at low redshifts {z < 0 . 1 ). 

Finally, another question of interest is to determine 
how faint a spectroscopic survey needs to reach in order 
to probe a certain mass limit at a given redshift. Figure 
4 presents the cumulative icmod magnitude distribution of 
stellar mass threshold samples as a function of redshift. This 
figure demonstrates that galaxy samples with icmod < 20 are 
roughly 90% complete for logj^g(M,) = 11.7 at z = 0.65. For 
comparison, CMASS includes a cut at icmod < 19.9 which 
impacts the completeness at higher redshifts. A more in- 
depth study of the impact of the CMASS cuts is provided 
in the following section. 



Redshift 


Figure 2. Number density as a function of redshift for sev¬ 
eral galaxy samples considered in this paper. The magenta line 
corresponds to LOWZ targets (“TARXOWZ”). The purple line 
corresponds to Legacy SDSS-II LRGs (“Legacy”). The blue 
line corresponds to redMaPPer cluster members (“RM_CLUS”, 
Rykoff et al. 2014). The green line corresponds to galaxies 
with Zred and Xred ^ ^ (“ZRED”). The solid red line cor¬ 
responds to CMASS targets (“TAR_CMASS”) and the dashed 
red line corresponds to CMASS galaxies from the LSS catalog 
(“LSS_CMASS”). Grey dashed lines indicate the number densi¬ 
ties of stellar mass threshold samples as a function of redshift 
estimated from the S82-MGC. 


4 EFFECTS OF TARGET SELECTION ON 

BOSS GALAXY SAMPLES 

We begin with a broad investigation of the impact of the 
BOSS target selection on the BOSS galaxy samples as 
a function of redshift, stellar mass, and color. To repre¬ 
sent galaxy colors we use an estimate of the birth pa¬ 
rameter, fciooo, which is the ratio of the average star for¬ 
mation rate within the previous 1 Gyr to the star for¬ 
mation rate averaged over the galaxy’s history. In this 
work, we use fciooo estimates from the KCORRECT pack¬ 
age (Blanton & Roweis 2007) which are computed from an 
SED fit of a linear combination of stellar population tem¬ 
plates from Bruzual & Chariot (2003) (hereafter, BC03). As 
for M* estimates, &iooo depends on the assumed models and 
priors used to the fit the observed SEDs. It is a rough es¬ 
timate of recent star formation that is based on more than 
a single optical color (such as g — r) and which takes ad¬ 
vantage of the added constraints on dust afforded by the 
near-IR photometry. 

In the S82-MGC, galaxies at z ^ 0.55 and M, ~ 11.5 
have biooo values as high as 0.7, suggesting an occasional 
high rate of recent star formation, but the vast majority 
are peaked near 61000 = 0 as expected for an old, passively 
evolving stellar population. Paper I explores in greater de¬ 
tail how 61000 better separates red-sequence galaxies with 
ongoing dusty star formation from those with truly passive 
populations. 

Figure 5 shows galaxies from the S82-MGC in the 
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Figure 3. Origin of redshifts that contribute to ^best a function of stellar mass. The solid red line indicates the fraction of galaxies 
with spectroscopic redshifts and with log]^Q(M*/M 0 ) > 11 (left panel), log]^Q(M+/MQ) > 11.4 (middle panel), and log]^Q(M*/M 0 ) > 11.6 
(right panel) as a function of redshift. At z < 0.61, it is possible to construct mass-limited samples with log;LQ(M/M 0 ) > 11.6 for which 
more than 80% of the sample has a spectroscopic redshift. At logj^Q(M/M 0 ) > 11.0, however, spectroscopic samples must be significantly 
supplemented by photometric redshifts, except at low redshifts (z < 0 . 1 ). 


z=[0.3,0.4] z=[0.4,0.5] z=[0.5,0.6] z=[0.6,0.7] 



Figure 4. Cumulative magnitude distribution of stellar-mass threshold samples as a function of redshift. The dashed horizontal lines 
shows the 90% completeness limit. The dashed vertical line corresponds to icmod = 20. One of the cuts that defines the CMASS samples 
is a magnitude cut at icmod < 19.9. This figure demonstrates, for example, that galaxy samples with icmod < 20 are 90% complete tor 
log,Q(Af*) = 11.7 at ^ = 0.65. 


CMASS redshift range as a function of icmod and dperp- Be¬ 
cause a number of galaxies in Figure 5 are not contained in 
the original BOSS target catalog®, icmod and dperp are taken 
from the deeper Stripe 82 co-add photometry (Annis et al. 
2011). For CMASS galaxies, the typical RMS scatter be¬ 
tween the co-add photometry and the shallower photometry 
of the target catalog is of order urms = 0.08 mag for dperp 
and CTRMS = 0.14 mag for icmod- 

Figure 5 clearly demonstrates that the CMASS sam¬ 
ple is only complete in terms of mass and color at the 
highest masses (log,g(M,/M 0 ) > 11.7) and in a narrow 
redshift window at z 0.6. The dperp cut mainly affects 
the sample selection at lower redshifts {z < 0.6). The 

® bosstarget-lrg-main007-collate.fits. See Appendix. 


flux limit affects the sample selection at low stellar masses 
(log^Q(M*/M 0 ) < 11.5) and at higher redshifts (z > 0.6) 
but is relatively unimportant for massive galaxies at low 
redshifts. Galaxies with high values of fciooo (suggesting re¬ 
cent star-formation) are excluded from the sample at low 
redshift due to a combination of all three cuts. At higher 
redshifts {z > 0 . 6 ), the sample is mainly flux limited and 
includes a larger range of galaxy colors at fixed magnitude. 

Figure 6 presents rcmod and cy in the LOWZ redshift 
range where rcmod and cy are taken from the S82-MGC. For 
LOWZ galaxies, the typical RMS scatter between the co-add 
photometry and the shallower photometry of the target cata¬ 
log is of order ctrms = 0.07 mag for cy and ctrms = 0.11 mag 
for rcmod. Figure 6 shows that the LOWZ sample is roughly 
complete in terms of mass and color at /Mq) > 
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Figure 5. Galaxies from the S82-MGC in the CMASS redshift range as a function of icmod a-nd ciperp- Galaxies are color coded according to 
their birth parameter 61000 ■ larger values of biooo (bluer colors) indicate galaxies that have experienced larger amounts of star formation 
in the past Gyr. Vertical black lines indicate the GMASS flux limits (equation 10). The horizontal line indicates the CMASS dperp cut 
(equation 8 ). Tilted black lines indicate the fiducial sliding cut (equation 9) as well as the sliding cut that defines the sparsely sampled 
region (equation 15). The vertical error bar indicates the typical scatter between the target photometry and the co-add photometry. 
The horizontal error bar indicates the typical scatter for rt-mod- The stellar mass completeness of CMASS is due to the fact that the 
intrinsic color distributions of galaxies at fixed mass often extend beyond the BOSS color boundaries, as well as to scatter across these 
color boundaries because of the shallower depth of the target photometry. The later effect is not present in this Figure which is based 
on co-add photometry. 


11.6 over the redshift range 0.15 < 2 : < 0.43. At lower stel¬ 
lar masses, the mass completeness is limited by the cy slid¬ 
ing cut. The flux limit mainly affects the sample selection 
at low stellar masses (logpo(M*/M 0 ) < 11.7) and at higher 
redshifts {z > 0.3). 


5 TOTAL STELLAR MASS FUNCTIONS 

To estimate the completeness of the BOSS samples, we first 
estimate the total stellar mass function in four redshift bins. 
For a given mass bin and redshift, the stellar mass complete¬ 
ness of any given spectroscopic sample, A, will be estimated 
via the ratio c = (jiA/tj^tot where (pA is the number density of 


sample A and 4>tot is the number density of the total stellar 
mass function. 

We construct redshift bins that correspond to a vol¬ 
ume of 0.05-0.06 Gpc®. For the LOWZ sample, this re¬ 
quires a single large redshift bin, zi = [0.15,0.43]. For 
CMASS, we divide the sample into three equal volume bins: 
22 = [0.43,0.54], 23 = [0.54,0.63], and 24 = [0.63,0.7]. The 
time span for these redshift bins is 2.6 Gyr, 0.8 Gyr, 0.6 
Gyr, and 0.4 Gyr, respectively. We take the redshift-binned 
mass functions to represent the galaxy distribution sampled 
at the midpoint of each bin. A more detailed investigation 
of the redshift evolution of the total SMF is presented and 
discussed in Paper III (Bundy et al. in prep). 

To construct the total SMF, we use 2 best and Mbest from 
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Figure 6 . Galaxies from the S82-MGC in the LOWZ redshift range as a function of rcmod s-nd C||. Galaxies are color coded according to 
their birth parameter 61000 ■ larger values of biooo (bluer colors) indicate galaxies that have experienced larger amounts of star formation 
in the past Gyr. Vertical black lines indicate the LOWZ flux limits (equation 6 ). The tilted black line represents the LOWZ sliding cut 
(equation 5). Black triangles indicate galaxies that are removed by the c± cut (equation 4). The vertical error bar indicates the typical 
scatter between the target photometry and the co-add photometry. The horizontal error bar indicates the typical scatter for 


the S82-MGC to measure the SMF at the high-mass end and 
data from PRIMUS (Moustakas et al. 2013) to evaluate the 
low-mass end. Because we use a parent sample that is com¬ 
plete above logMt/MQ = 11.2 for 2 < 0.7, Umax corrections 
are not required. Errors on the Stripe 82 SMFs are derived 
via bootstrap using 214 roughly equal-area bootstrap re¬ 
gions. 

Figure 7 shows mass functions from PRIMUS and from 
the S82-MGC over 0.43 < 2 < 0.7. The figure demonstrates 
two key points. First, it is clear that mass functions from 
PRIMUS and COSMOS are insufficient to constrain the 
SMF at masses above /Mq) = 11.5 — emphasizing 

the importance of the S82-MGC at these high masses. Sec¬ 
ond, as expected, the SMF of the S82-MGC UKWIDE sam¬ 
ple itself begins to turn over due to incompleteness around 
logiQ(M*/MQ) ~ 11.2 at 2 ~ 0.7. 

To obtain total SMFs for our subsequent analysis, we 


combine SMF measurements from the PRIMUS survey at 
log]^Q(M*/M 0 ) < 11.3 with those from the S82-MGC at 
logiQ(M*/M 0 ) > 11.3. For PRIMUS we use the mass func¬ 
tions estimated by Moustakas et al. (2013) in the redshift 
range closest to our bin®. The PRIMUS mass functions as¬ 
sume a Chabrier IMF, BC03 stellar population models, and 
have been adjusted to match our fiducial cosmology. Fig¬ 
ure 8 presents the total SMF in our four redshift bins from 
2 = 0.15 to 2 = 0.7. 


® Small offsets (~0.1 dex) are likely between the PRIMUS stellar 
masses and the masses from the S82-MGC. However, PRIMUS did 
not compute stellar masses for their Stripe 82 field due to a lack of 
Spitzer data. Hence, we cannot directly compare mass estimates 
between PRIMUS and the S82-MGC. However, we expect that any 
offsets will only have a minor impact on completeness estimates 
and only for logj^Q(AL*/M 0 ) < 11.3. 


MNRAS 000, 000-000 (0000) 



















































Stellar Mass Completeness of BOSS 9 


10^ r 


. 


10- r 


10 - r 


10- r 


-1—1—1—1—[- 


A S82-MGC 0.43<z<0.7 
O PRIMUS 0.4<z<0.5 
A PRIMUS 0.5<z<0.65 
O PRIMUS 0.65<z<0.8 


H 


10.8 11.0 


11.2 11.4 11.6 

Log,„(M. [Mel) 


11.8 


12.0 


CMt-zp = 0) as a function of mass and redshift is identical 
to the S82-MGC. 

The results of fitting our mock samples to the observed 
SMF data are presented in Figure 8 . Because a large frac¬ 
tion of galaxies at the high-mass end have a spectroscopic 
redshift, the inclusion of galaxies with photometric redshifts 
only has a very minor impact on the SMF. The effects of 
CMt-zp are hence negligible for the S82-MGC. The best fit 
values for the double Schecter fits are given in Table 1. In 
this exercise, we have not accounted for additional sources of 
scatter in M*, namely from the mass estimates themselves 
or from uncertainties in total luminosities. Because these 
latter sources of errors are also present in the spectroscopic 
samples, they cancel in our completeness functions which 
divide spec-z SMFs by those supplemented with photo-zs. 


Figure 7. Total stellar mass function derived using z:best for 
0.43 < z < 0.7. Diamonds show the stellar mass functions from 
PRIMUS in three redshift bins from z = 0.4 to 2 = 0.8. Our 
Stripe 82 sample tightly constrains the high-mass end of the stel¬ 
lar mass function (logpg(M*/MQ) > 11.5) while PRIMUS con¬ 
strains the low mass end. The Stripe 82 sample is mass-complete 
to logiQ{Mf /Mq) = 11.2 at 2 = 0.7. 


The S82-MGC 2best parameter uses a combination of 
spectroscopic redshifts and photometric redshifts. Because 
redshift errors translate into an error on the luminosity dis¬ 
tance, galaxies that do not have a spectroscopic redshift 
will have an extra error term for Mbeat which we denote as 
CMt-zp. In the S82-MGC, this error term ranges from about 
frM*-zp = 0.05 dex to crM*-zp =0.1 dex. One concern with 
our approach is that the inclusion of galaxies with photo¬ 
metric redshifts will cause an increase (due to Eddington 
bias) in the amplitude of the steep, high-mass end of the to¬ 
tal SMF compared to a scenario in which all galaxies in our 
sample had a spectroscopic redshift. This behavior would 
lead our completeness estimates to be underestimated be¬ 
cause <j)tot would be artificially inflated by this additional 
scatter term relative to the number densities obtained for a 
spectroscopic sample. 

We account for this effect by forward modeling the 
SMF by convolving for M* scatter induced by photometric 
redshift uncertainties. We assume an input functional form 
for the total SMF that follows a double Schechter function 
(Baldry et al. 2008): 




(In 10 ) exp 


' MT 


X 


|j^ll0(“l+l)('°S A4,-log Mo) 0 ^^Q(c« 2 -t-l){logM,-log Mo) | 

(16) 

where 02 > a\ and the second term dominates at the low- 
mass end. We generate Monte Carlo realizations of this 
function that sample various ranges for its parameters. The 
values of 4>i, (f )2 and Mo are allowed to vary, while we fix 
ai = —0.46 and 02 = —1.58. A mock sample is drawn from 
each realization of the input SMF and additional scatter, 
(TMt-zp, is added to the mock stellar masses of this sam¬ 
ple following the estimated (tm»-zp(A/*, z ) distribution from 
the S82-MGC. In our Monte Carlo mock realizations, the 
fraction of galaxies that have a spectroscopic redshift (i.e. 


6 STELLAR MASS COMPLETENESS OF BOSS 

SAMPLES 

6.1 Stellar Mass Completeness of CMASS 

With the total SMF now in hand, we derive the stellar mass 
completeness for the LSS_CMASS sample. Completeness is 
estimated by comparing the total number of LSS_CMASS 
galaxies in a given redshift and stellar mass bin to that de¬ 
rived from the total SMF. For the total SMF, we use the 
three redshift bins 22 , 2 : 3 , and 24 . Figure 9 presents our to¬ 
tal SMF compared to target CMASS galaxies, fiber collided 
galaxies, CMASS galaxies from the LSS catalog, and Legacy 
objects, in our three fiducial redshift bins. 

We further sub-divide each of these redshift bins into 
two roughly equal volume bins to compute the complete¬ 
ness of CMASS in a total of six redshift bins. The CMASS 
SMFs for each of these six redshift bins are shown in Figure 
10. If the number of LSS_CMASS galaxies fluctuates above 
the prediction based on the total SMF, the completeness is 
simply set to unity. The results are presented in Figure 11 
and the completeness values are given in Table 2. 

For convenience, we also fit the completeness with the 
following functional form: 


c = 


/ 

2 




(17) 


with free parameters /, cr, and Mi. The results are shown 
on the right hand side of Figure 11 and the values of the 
fitted parameters are given in Table 3. 

Figure 11 demonstrates that CMASS is 80% complete 
at logjQ(M*/M 0 ) > 11.6 in the narrow redshift range 2 = 
[0.51, 0.61]. At the mean redshift of the CMASS sample, 2 = 
0.55, CMASS is roughly 80% complete at logjo(M*/M 0 ) = 
11.4. For comparison, Maraston et al. (2013) conclude that 
BOSS is complete above logiolM,/M q) = 11.3 at 2 < 0.6 
and at ~ 11-6 at 2 > 0.6. This work nar¬ 

rows this statement to 2 = [0.51, 0.61] and shows that the 
completeness decreases at lower and higher redshifts. Based 
on these considerations, referring to this sample in terms of 
“constant mass” should only be considered in loose terms. 
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Figure 8. Total SMF in four redshift bins from 2 : = 0.15 to z = 0.7. To construct the total SMF, we combine data from the PRIMUS 
survey at logjo(M*/M 0 ) < 11.3 with data from the S82-MGC at logjo(M*/M 0 ) > 11.3. Solid lines indicate our fit to the total SMF 
and dotted lines show the total SMF deconvolved for the effects of scatter introduced by photometric redshift. Because a large fraction 
of galaxies at the high-mass end have a spectroscopic redshift (>80% at the high mass end), the inclusion of galaxies with photometric 
redshifts has a negligible impact on the SMF. 


Table 1. Parameters of the double Schecter fit to the total SMF as a function of redshift. Only the first three parameters are varied in 
the fit. These parameters correspond to the SMF after deconvolving for the effects of scatter due to the inclusion of a sub-set of galaxies 
with photometric redshifts. 


Redshift 

loglo(</'l/Mpc 3dex 1 ) 

logic(</' 2 /Mpc ^dex 1 ) 

logic ( 11 ^ 0 /A^e) 

ai 

a2 

Z1 = [0.15,0.43] 

-2.97 

-2.79 

10.910 

-0.46 

-1.58 

Z 2 = [0.43, 0.54] 

-2.95 

-2.89 

10.922 

-0.46 

-1.58 

Z 3 = [0.54,0.63] 

-3.06 

-2.91 

10.986 

-0.46 

-1.58 

Z 4 = [0.63,0.7] 

-3.06 

-2.92 

10.995 

-0.46 

-1.58 


Table 2. Measured stellar mass completeness of the CMASS sample. Bins with less than 10 galaxies are marked with an ’x’ symbol. 
The bottom line corresponds to the completeness estimates for the combined LOW-hCMASS sample at 0.38 < z < 0.48. 


logioCA^t/Ma) logiQ(M*/MQ) logiQ(M*/M0) log,o(M*/MQ) logio(M,/M0) logio(M*/M0) logio(M,/M0) 

= 11.09 =11.21 =11.34 =11.46 =11.59 =11.71 =11.84 


0.46 

0.07 

± 

0.01 

0.24 

± 

0.01 

0.45 

± 

0.03 

0.51 

± 

0.04 

0.57 

± 

0.06 

0.65 

± 

0.11 


X 


0.51 

0.08 

± 

0.00 

0.30 

± 

0.01 

0.66 

± 

0.03 

0.85 

± 

0.05 

0.90 

± 

0.07 

1.00 

± 

0.17 


X 


0.56 

0.03 

± 

0.00 

0.15 

± 

0.01 

0.50 

± 

0.02 

0.82 

± 

0.04 

0.85 

± 

0.06 

0.83 

± 

0.10 

1.00 

± 

0.23 

0.61 

0.02 

± 

0.00 

0.06 

± 

0.01 

0.28 

± 

0.02 

0.58 

± 

0.03 

0.92 

± 

0.06 

1.00 

± 

0.12 

1.00 

± 

0.28 

0.65 

0.01 

± 

0.00 

0.02 

± 

0.00 

0.10 

± 

0.01 

0.29 

± 

0.02 

0.51 

± 

0.04 

0.95 

± 

0.09 

0.97 

± 

0.24 

0.69 

0.00 

± 

0.00 

0.02 

± 

0.00 

0.04 

± 

0.01 

0.16 

± 

0.01 

0.46 

± 

0.05 

0.89 

± 

0.11 

1.00 

± 

0.39 

comb 

0.05 

± 

0.00 

0.16 

± 

0.01 

0.42 

± 

0.02 

0.68 

± 

0.03 

0.92 

± 

0.06 

1.00 

± 

0.13 

1.00 

± 

0.41 
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Zj=[0.43,0.54] Z3=[0.54,0.63] Z4=[0.63,0.7] 



Figure 9. Total SMF compared to target CMASS galaxies (red stars), fiber collided galaxies (orange triangles), CMASS galaxies from 
the large-scale structure catalog (green plus symbols), and Legacy objects (blue diamonds) in three different redshift bins. We sub-divide 
each of these redshift bins into two roughly equal volume bins to compute the completeness of CMASS in a total of six redshift bins. At 
low redshifts, legacy objects make up 50% of galaxies at the high mass end. 


Z2=[0.43,0.54] Z3=[0.54,0.63] Z4=[0.63,0.70] 



Figure 10. Stellar mass functions of the LSS_CMASS sample in six redshift bins compared to the total SMF. Completeness is estimated 
via the ratio c = 0 a /0tot where 0 a is the number density of CMASS in a given redshift bin and 0tot is the number density of the total 
stellar mass function. 


Table 3. Functional fits for completeness of the LSS_CMASS 
sample. The bottom line corresponds to the combined 
LOWZ-I-CMASS sample. 


Redshift / 

O- log4Q(ALi/MQ) 

0.46 

0.57 

0.20 

11.24 

0.51 

0.95 

0.20 

11.28 

0.56 

1.00 

0.22 

11.36 

0.61 

1.00 

0.22 

11.44 

0.65 

1.00 

0.27 

11.57 

0.69 

1.00 

0.26 

11.64 

comb 

1.00 

0.25 

11.38 


6.2 Stellar Mass Completeness of LOWZ 

We now proceed in a similar manner for the LOWZ sample 
in the redshift range 0.15 < 2 : < 0.43. For LOWZ, we use 
the total SMF estimated in the the zi — [0.15, 0.43] redshift 
bin and shown in Figure 8. Figure 12 presents the SMFs of 
the LOWZ sample compared to the total SMF as a function 
of redshift. In a similar fashion as in the previous section, 
completeness is estimated by comparing the total number of 
LSS_LOWZ galaxies in a given redshift and stellar mass bin 
compared to the expectation derived from the total SMF. 
The results are displayed in Figure 13 and the completeness 
values are listed in Table 4. The values of the functional fits 
to the completeness are given in Table 5. 

The errors on the LOW completeness are large at the 
high mass end because the sub-volumes used to measure the 
redshift dependance of the completeness are relatively small. 
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Figure 11. Left: stellar mass completeness of the CM ASS sample as a function of stellar mass and redshift. For clarity, errors are shown 
only for the 2 : = 0.61 redshift bin. Right: fits to the completeness using Equation 17. 


Table 4. Measured stellar mass completeness of the LOWZ sample. 


z 

loglo(^*/A4o) 

=11.09 

logio(A4*/M0) 

=11.21 

logic (A4*/M0) 

= 11.34 

logic (A/'*/M0) 

=11.46 

logic (A 4 ./M 0 ) 
=11.59 

logic (44* /Mq) 
=11.71 

0.18 

0.10 ±0.01 

0.69 ± 0.04 

1.00 ±0.07 

1.00 ±0.10 

0.96 ±0.15 

X 

0.23 

0.04 ±0.01 

0.39 ±0.03 

0.89 ± 0.07 

1.00 ±0.10 

1.00 ±0.16 

1.00 ±0.35 

0.29 

0.05 ±0.01 

0.29 ±0.02 

0.82 ± 0.05 

0.87 ± 0.07 

1.00 ±0.12 

1.00 ±0.26 

0.35 

0.09 ±0.01 

0.29 ±0.02 

0.62 ± 0.04 

0.82 ± 0.06 

0.76 ±0.09 

0.94 ±0.18 

0.40 

0.01 ±0.00 

0.10 ±0.01 

0.46 ±0.03 

0.79 ± 0.05 

0.98 ±0.10 

1.00 ±0.19 



Login (M. [M©]) 


Figure 12. Stellar mass functions of the LSS_LOWZ sample in 
five redshift bins compared to the total SMF. Completeness is 
estimated via the ratio c = 0 A/ 0 tot, where is the number 
density of LOWZ in a given redshift bin and (ptot is the number 
density of the total stellar mass function. In this redshift bin, our 
double Schecter fit slightly under-estimates the total SMF at the 
high-mass end at logiQ(M*/M 0 ) ~ 11 . 8 . 


Table 5. Functional fits for completeness of the LSS_LOWZ sam¬ 
ple. 


Redshift 

/ 

(7 

logic(Afi/M 0 ) 

0.18 

1.00 

0.11 

11.18 

0.23 

1.00 

0.12 

11.24 

0.29 

1.00 

0.16 

11.27 

0.35 

0.87 

0.20 

11.27 

0.40 

1.00 

0.17 

11.36 


As a result, for example, the best-fitting functional form 
does not converge to unity at large masses in the z = 0.35 bin 
but does in the z = 0.29 and 0.40 redshift bins. The values 
provided in this paper should be considered as estimates 
with relatively large errors at the high mass end. Studies 
that are sensitive to these completeness estimates must take 
the reported errors in Table 4 into consideration. 

Figure 13 demonstrates that LOWZ is at least 80% 
complete at logj^g(M*/M 0 ) > 11.6 over the entire redshift 
range and at least 90% complete at log]^Q(M*/M 0 ) > 11.5 
in the redshift range z = [0.18,0.29]. At the mean redshift 
of the LOWZ sample, z = 0.29, LOWZ is roughly 80% 
complete at logjQ(M*/M 0 ) = 11.4. Interestingly, these re¬ 
sults counter the conventional wisdom that CMASS is more 
complete at higher stellar masses compared to LOWZ {e.g., 
Anderson et al. 2014). 

Finally, although we do find a high stellar mass com- 


MNRAS 000, 000-000 (0000) 

























Stellar Mass Completeness of BOSS 13 



11.0 11.2 11.4 11.6 11.8 12.0 11.0 11.2 11.4 11.6 11.8 12.0 

Log,o(M* [MoD Log,o(M. [Mq]) 


Figure 13. Left: stellar mass completeness of the LOWZ sample as a function of stellar mass and redshift. For clarity, errors are shown 
only for the z = 0.29 redshift bin. Right: fits to the completeness using Equation 17. 


pleteness for LOWZ, Hoshino et al. (2015) reported that the 
fcmod > 16 cut in equation 6 removes a small fraction (~5%) 
of Brightest Cluster Galaxies from the nominal LOWZ sam¬ 
ple. 

6.3 Combined Sample 

As can be seen from Figure 2, CMASS and LOWZ overlap 
in the redshift range 2 0.38 to z ^ 0.48. At these red- 

shifts, it may be useful for certain studies to combine the 
two samples together. The stellar mass completeness of the 
combined LOWZ and CMASS samples in the redshift range 
0.38 < 2 < 0.48 are presented in Figure 14. Legacy objects 
are also included in this combined sample. The combined 
sample is roughly 80% complete to log]^Q(M*/M 0 ) = 11 . 6 . 
The completeness values for the combined samples are ap¬ 
pended to the bottom of Table 2 and Table 3. 

In conclusion, the combination of LOW and CMASS 
(and Legacy objects) is 80% complete to log^Q(M*/MQ) = 
11.6 at 2 < 0.61. 


7 SAMPLES FROM PREVIOUS STUDIES 
USING BOSS DATA 

In this section, we evaluate the potential impact of stel¬ 
lar mass incompleteness for specific samples that have been 
used in previous BOSS analyses. We focus on the sam¬ 
ples used in Maraston et al. (2013), Shankar et al. (2014), 
Miyatake et al. (2013), and More et al. (2014). 

7.1 CMASS Mass Functions from Maraston et al. 
(2013) 

Maraston et al. (2013) computed stellar masses for BOSS 
galaxies using SED fits to the SDSS single epoch 
ugnz photometry. SED fits were performed using the 
HYERSPECZ code (Bolzonella et al. 2000) using the 
Maraston & Stromback (2011) stellar population libraries 
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Log,o(M. [M®]) 


Figure 14. Stellar mass completeness of the combined 
LOWZ-I-CMASS sample in the redshift range 0.38 < 2 < 0.48. 
Legacy objects are included in this combined sample. The solid 
line corresponds to the best-fit using Equation 17. The combined 
sample is roughly 80% complete to log]^g(M*/AL 0 ) = 11 . 6 . 


and assuming a Kroupa IMF (Kroupa 2001). Masses were 
computed for all BOSS galaxies in two separate runs: one 
using a passive template and one using a suite of star¬ 
forming templates with exponentially declining or truncated 
star-formation histories. The stellar masses of star-forming 
galaxies were adjusted upwards by 0.25 dex to account for a 
bias when fitting star forming galaxies in which the best-fit 
model may underestimate the total stellar mass because of 
fitting the brightest population (Maraston et al. 2010). This 
bias was identified using mock catalogs. For both templates, 
masses with and without mass loss were computed in or¬ 
der to allow comparisons with the literature. These stellar 
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masses are a standard output of the BOSS pipeline and are 
publicly released. 

The two separate runs were combined into a single stel¬ 
lar mass catalog by adopting the star-forming templates for 
BOSS galaxies with apparent g — i colors less than 2.35 and 
the passive template for galaxies with g — i > 2.35. This 
matched the empirical morphological mix determined from 
the COSMOS field by Masters et al. (2011) 

Maraston et al. (2013) computed the mass function for 
the CMASS sample in three redshift bins from z = 0.45 
to z = 0.7 with the aim at constraining the assembly 
of the most massive galaxies in relation to galaxy forma¬ 
tion models. Because the stellar mass completeness of the 
CMASS sample was not known at that time, Maraston et al. 
(2013) did not apply any completeness corrections to these 
CMASS SMF, opting instead to apply the BOSS selection 
cuts to semi-analytic models (SAMs) when comparing with 
the theoretical mass function from galaxy formation mod¬ 
els. Maraston et al. (2013) find a deficit of massive galaxies 
(log(M*/M©) > 11.3 for a Kroupa IMF) between the BOSS 
data and SAMs over the redshift range 0.45-0.6. 

Paper I presents an in-depth comparison between the 
mass estimates from Maraston et al. (2013) and those from 
the S82-MGC. For our three fiducial redshift bins, there is a 
mean offset between the two stellar mass estimates, denoted 
S = logj^g(Mf®^“^°'^) — logj^Q(M^^^®). In the range 11.3 < 
logiQ(Mf®^“''^^'^) < 11.6, <5 = 0.11 dex for Z 2 , S = 0.14 dex 
for 23 , 5 = 0.10 dex for 24 . 

With a better understanding of the CMASS sample now 
in hand, we re-investigate the mass completeness of these 
CMASS SMFs from this early BOSS analysis. Figure 15 
presents a comparison between the total SMF derived in this 
paper with the CMASS SMFs derived in Maraston et al. 
(2013) (adjusted to our fiducial cosmology). Green data 
points indicate the Maraston et al. (2013) mass functions af¬ 
ter applying the mean offsets for each redshift bin. We find a 
lower amplitude in our total SMF at the high-mass end com¬ 
pared to Maraston et al. (2013). One possible explanation 
for this difference is that the Maraston et al. (2013) mass 
estimates may have a larger scatter compared to the S82- 
MGC, which are based on deeper optical and NIR photom¬ 
etry. A larger mass error would cause the Maraston et al. 
(2013) SMFs to be inflated at the high-mass end relative to 
those computed from the S82-MGC (due to Eddington bias). 

By qualitatively comparing the CMASS mass func¬ 
tions with previous analyses, Maraston et al. (2013) pro¬ 
vided a first estimate of the completeness of the CMASS 
sample, tentatively concluding a rough mass completeness 
of log^Q(M*/M©) = 11.3 at 2 < 0.6 and at logj^g(M*/M 0 ) = 
11.6 at 2 > 0.6. The analysis presented here suggests that 
these original estimates were reasonable, although somewhat 
optimistic. A more accurate estimate is presented in Table 
2 . 

7.2 Halo Occupation Modeling from 
Shankar et al. (2014) 

Shankar et al. (2014) recently studied the high-mass slope 
and scatter of the stellar-to-halo mass relation to 2 = 1 us¬ 
ing a variety of different data sets. One component of their 
analysis is based on Halo Occupation Distribution (HOD, 
Berlind & Weinberg 2002, and references therein) modeling 


of the projected two-point correlation function of CMASS 
galaxies. This clustering analysis was performed in two red¬ 
shift bins: 0.4 < 2 < 0.6 and 0.6 < 2 < 0.8. A stellar mass 
cut of logiQ{Mt:/M q) > 11.5 was applied to both samples. 
The Maraston et al. (2013) passive template mass catalog 
was adopted for this analysis (Hong Guo, priv comm)^. 

One underlying assumption of the HOD-type model em¬ 
ployed by Shankar et al. (2014) is that stellar-mass selected 
threshold samples should be mass complete. In this case, the 
central occupation function of central galaxies is generally 
expected to be well described by a traditional erf function 
that converges to unity at large masses. If the samples under 
consideration are incomplete in mass, then the amplitude 
and form of the central occupation function is uncertain. 

With this possibility in mind, we evaluate the mass 
completeness of the two samples employed by Shankar et al. 
(2014) using the same methodology as in Section 6.1. To esti¬ 
mate the completeness, we recompute the total SMF within 
each of the two Shankar et al. (2014) redshift bins and show 
our results in Figure 16. 

There are two aspects of this analysis worth empha¬ 
sizing. First, in practice, all stellar mass estimates are im¬ 
perfect and will have scatter relative to the true underly¬ 
ing stellar mass. Hence, any stellar mass “threshold” yields 
a sample with a more smoothly varying selection function 
on the true M* distribution. We can roughly estimate the 
magnitude of this effect by assuming that the scatter be¬ 
tween the Maraston et al. (2013) passive template masses 
and the S82-MGG masses is similar to the scatter between 
either estimate and the true underlying stellar mass. This 
effect will be captured if we use the actual Shankar et al. 
(2014) sample (i.e, defined by a selection with respect to the 
Maraston et al. 2013 masses) but evaluate the completeness 
using S82-MGC masses. This is shown in the left hand side 
of Figure 16. The samples no longer have a sharp boundary 
in mass but instead follow a more smoothly varying com¬ 
pleteness function. 

Second, we can evaluate the completeness of a threshold 
cut on M* defined for one set of M* estimates by trans¬ 
lating those thresholds into M* used in this work. The 
Shankar et al. (2014) sample, for example, was selected as 
logj^Q(M^^^®“^'‘'*®) > 11.5. Comparing to the Maraston et al. 
(2013) passive template masses used in that work, we find a 
mean offset® of 5 = —0.06 dex and S = —0.06 for the low and 
high redshift bins respectively compared to our masses from 
S82-MGC. Accounting for this mean offset, the fixed mass cut 
of log]^g(M.J^^®~*’^*’*’) > 11.5 used by Shankar et al. (2014) 
corresponds to cuts of logj^g(M*) > 11.43 (low —2 bin) and 
logjQ(M*) > 11.44 (high -2 bin) using our M* estimates. At 
these limits, CMASS is 75% complete for the low-redshift 
sample and only 15% in the high-redshift sample (middle 
panel of Figure 16). 

Finally, the right hand panel of Figure 16 displays the 
difference between the Maraston et al. (2013) passive tem- 


^ As opposed to creating a merged catalog using both the passive 
and the star-forming templates as in Maraston et al. (2013). 

® Mass offsets will differ from those quoted in Section 7.1 because 
in one case a merged (star-forming plus passive) catalog was cre¬ 
ated, and in the other the passive template was adopted for all 
galaxies. 
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Figure 15. Comparison between the total SMF derived in this work and the CMASS-only SMFs presented in Maraston et al. (2013). 
Green data points represent the Maraston et al. (2013) SMFs after applying our estimate of the mean offset between the Maraston et al. 
(2013) masses and S82-MGC masses for each redshift bin (these offsets are of order 0.1-0.15 dex). The Maraston et al. (2013) SMFs are 
computed for CMASS galaxies and do not include any completeness correction factors. 


plate masses and the s82-mgc masses at these scales. Both 
the offset and relative scatter are apparent. 

This exercise highlights the stellar mass incompleteness 
of these samples and demonstrates that that caution needs 
to be taken when studying CMASS. Understanding to what 
extend this may or may not have an impact on the original 
Shankar et al. (2014) analysis is beyond the scope of this 
paper. 


7.3 Cosmological Analysis from Miyatake et al. 

(2013) and More et al. (2014) 

In two companion papers, Miyatake et al. (2013) and 
More et al. (2014) present a joint analysis of the abundance, 
clustering, and galaxy-galaxy lensing signal measured for 
CMASS sub-samples. Using a HOD framework, they derive 
constraints on the high-mass end of the stellar-to-halo mass 
relation and on the cosmological parameters flrn and ns. 
Miyatake et al. (2013) and More et al. (2014) consider three 
CMASS samples selected in the range 0.47 < z < 0.59: 
“Sample A” with log]^g(M*/M 0 ) > 11.1 (“cut 1”), “Sam¬ 
ple B” with /Mq) > 11.3 (“cut 2”), and “Sample 

C” with \og^Q{Mt,/M q) > 11.4 (“cut 3”). The fiducial sam¬ 
ple in the More et al. (2014) cosmological analysis is the A 
sample. Stellar masses are taken from the Maraston et al. 
(2013) passive-template catalog and assume a Kroupa IMF. 

Using the same methodology as in the previous section, 
we compute the mass completeness for these three samples. 
The results are presented in Figure 17. Sample A is 30% 
complete at cut 1, sample B is 62% complete at cut 2 and 
sample C is 72% complete at cut 3. The right hand panel 
shows the incompleteness function of these samples evalu¬ 
ated using the original cuts. Again, the incompleteness of 
the two samples relative to the true underlying mass prob¬ 
ably lies between the left and the middle panel of Figure 
17. 

The More et al. (2014) HOD analysis accounts for po¬ 
tential incompleteness in the selection of CMASS galaxies 
at the low-mass end when compared to a true stellar mass 
threshold sample. However, the difficulty when working with 
incomplete samples is that galaxies that are excluded from 


the sample are not a random population. For example, the 
More et al. (2014) incompleteness model assumes that the 
CMASS selection corresponds to a random selection at fixed 
stellar and halo mass, i.e., it assumes that the galaxies that 
are removed from the sample live in similar halo environ¬ 
ments as CMASS galaxies at fixed stellar mass. However, 
as shown in Figure 5, CMASS is color selected and prefer¬ 
entially selects certain regions of color space at fixed stel¬ 
lar mass. In our companion paper, we demonstrate that a 
model which accounts for the stellar mass completeness of 
the CMASS sample, and reproduces the SMF and the two 
point correlation functions, but assumes that color in un¬ 
correlated with environment at fixed stellar mass, fails to 
reproduce the monopole and the quadrupole of the correla¬ 
tion function (Saito et al. in preparation). Hence, the mod¬ 
eling of incomplete and color-selected samples may not be 
straight-forward. 

Again, a full evaluation of the impact of incompleteness 
on the conclusions of Miyatake et al. (2013) and More et al. 
(2014) is beyond the scope of this paper. The better charac¬ 
terization of the mass and color completeness of the BOSS 
samples presented here will make such evaluations possible 
and improve future attempts to accurately model the BOSS 
samples. 


8 SUMMARY AND CONCLUSIONS 

The BOSS survey has collected spectra for over one mil¬ 
lion galaxies at 0.15 < 2 < 0.7 over a volume of 15.3 
Gpc® (9,376 deg^) which provides an opportunity to study 
the most massive galaxy populations with vanishing sam¬ 
ple variance. However, the BOSS sample selections involve 
complex color cuts which are not necessarily optimized for 
galaxy science. As a result, the selection function and stellar 
mass completeness of these samples are poorly understood. 
Nonetheless, given the large volumes and consequently large 
sample sizes at play, these surveys have a tremendous po¬ 
tential to constrain the galaxy-halo connection and to inves¬ 
tigate the most massive galaxies in the universe providing 
that the samples are well understood. 
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Figure 16. Stellar mass completeness of the CMASS samples used in the Shankar et al. (2014) analysis. Red diamonds correspond to 
the low-redshift sample and blue diamonds indicate the high-redshift sample. The x-axis represents the mass estimate from the S82-MGC. 
Left panel: mass completeness evaluated using the actual Shankar et al. (2014) sample but relative to S82-MGC masses. This panel shows 
additional spread compared to a pure threshold sample due to scatter between the Maraston et al. (2013) passive template masses and 
the S82-MGC masses. Middle panel: mass completeness evaluated using a fixed S82-MGC stellar mass cut. Right panel: distribution of 
mass differences between the Maraston et al. (2013) passive template masses and the S82-MGC masses. Vertical lines indicate the 50th 
percentile of the distribution. A mass cut of log]^g(M^^^~^“'’) = 11.5 corresponds to a mass cut of log]^g(A4*) ~ 11.43 in the S82-MGG. 


jy[ M13 pass sgjggfjoj, S82-MGC selection Mean mass difference 



Figure 17. Stellar mass completeness of the CMASS samples employed by Miyatake et al. (2013) and More et al. (2014). The x-axis 
represents the mass estimate from the S82-MGC. Left panel: mass completeness evaluated using the Miyatake et al. (2013) and More et al. 
(2014) samples but relative to S82-MGC masses. This panel shows additional spread compared to a pure threshold sample due to scatter 
between the Maraston et al. (2013) passive template masses and the S82-MGC masses. Middle panel: mass completeness evaluated using 
a fixed S82-MGC stellar mass cut. Right panel: distribution of mass differences between the Maraston et al. (2013) passive template 
masses and the S82-MGC masses. Vertical lines indicate the 50th percentile of the distribution. A mass cut of log]^g(M.^^^ pass^ _ ^ 

corresponds to a mass cut of logjQ(Mt) ~ 11.2 in the S82-MGG. 


In this paper, we characterize the stellar mass complete¬ 
ness of the BOSS samples with the goal of enabling future 
studies to better utilize these samples for galaxy-formation 
science. We use data from Stripe 82, which is roughly 2 mag¬ 
nitudes deeper than the single epoch SDSS imaging, and 
construct a catalog of massive galaxies that is complete to 
logj^Q(M*/MQ) > 11.2 at 2 = 0.7. Using this catalog, we 
empirically derive the stellar mass completeness of the two 
main BOSS spectroscopic samples: the LOWZ sample at 
0.15 < 2 < 0.43 and the CMASS sample at 0.43 < 2 < 0.7. 
We provide convenient fitting formulas which can be used 
to estimate the completeness of each of these samples as a 
function of stellar mass and redshift. 

We demonstrate that CMASS is significantly im¬ 


pacted by mass incompleteness and is 80% complete at 
log]^g(M*/M 0 ) > 11.6 only in the narrow redshift range 2 = 
[0.51, 0.61]. At the mean redshift of the CMASS sample, 2 = 
0.55, CMASS is roughly 80% complete at logjQ(M*/MQ) = 
11.4. Based on these considerations, referring to this sam¬ 
ple in terms of “constant mass” should only be considered 
in loose terms. In contrast, we demonstrate that LOWZ is 
80% complete at logiQ{M,/MQ) > 11.6 over the entire red¬ 
shift range and 90% complete at log]^g(M*/MQ) > 11.5 in 
the redshift range 2 = [0.18,0.29]. At the mean redshift of 
the LOWZ sample, 2 = 0.29, LOWZ is 80% complete at 
logiQ{Mt,/M q) — 11.4. Hence, our results counter the con¬ 
ventional notion that CMASS is more complete at higher 
stellar masses compared to LOWZ (Anderson et al. 2014). 
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Our results suggest an interesting redshift window for 
studying the evolution of the most massive galaxies. The 
combination of LOW and CMASS yields a spectroscopic 
sample that is 80% complete at logjQ(M*/MQ) > 11.6 at 
z < 0.61. 

The values provided in this paper should be considered 
as estimates. As can be seen in Figures 11 and 13, the errors 
on the completeness at high stellar mass are large due to the 
limited volume of Stripe 82. Upcoming wide area surveys will 
be able to repeat this analysis with much higher precision 
using several hundred to thousands of square degrees. The 
values presented here are reported with respect to S82-MCG 
masses so offsets may need to be applied to translate these 
completeness values to other mass estimates - see Paper I 
for details. 

With upcoming surveys in mind, such as the HSC and 
Euclid surveys, which will overlap with the BOSS footprint, 
we also characterize how many supplementary galaxies with 
photometric redshifts will be needed at any given stellar 
mass and redshift bin in order to construct mass limited 
samples. A sample that is mass limited to logiQ{M/M q) > 
11.6 can be constructed at z < 0.61 by supplementing the 
BOSS samples with photometric redshifts at the ~20% level. 
At logjQ(M/MQ) > 11.0, however, spectroscopic samples 
need to be significantly supplemented by photometric red¬ 
shifts (at the 80% level). 

We use our methodology to evaluate the stellar mass 
completeness of several specific samples that have been used 
in past work using BOSS data. We demonstrate that pre¬ 
vious work has sometimes over-estimated the stellar mass 
completeness of BOSS samples and suggest that caution 
needs to be taken when analyzing BOSS samples for these 
types of studies. 

The completeness estimates provided by this paper will 
enable future studies to better utilize the BOSS samples 
for galaxy-formation and cosmology science. A better un¬ 
derstanding of the BOSS selection functions will also enable 
the construction of improved mock catalogs for the BOSS 
survey. Our companion paper presents improved mock cat¬ 
alogs that account for the stellar mass completeness of the 
BOSS CMASS sample as a function of redshift (Saito et al. 
in preparation). 
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APPENDIX A: CATALOGS 

Here we list the various links to the publicly available cata¬ 
logs used in this paper. 

• The original catalog used to target BOSS galaxies. 
The target catalog for Stripe 82 is bosstarget-lrg-main007- 
collate.fits and can be found at http://data.sdss3.org/ 
sas/drl0/boss/target/main007/. The target selection 
flags are contained in the bitfield BOSS_TARGETl. A de¬ 
scription of BOSS-TARGETl can be found at https: //www. 
sdss3.org/drlO/algorithms/bitmask_boss_targetl.php. 
Various targets can be selected via (BOSS-TARGETI 
&&2®) > 0 where i is a binary digit. In this paper, we are 
primarily concerned with the samples GAL_LOZ (i = 0), 
GAL_CMASS which corresponds to the CMASS selection 
described in section 2.1(i = 1), SDSS_KNOWN which 
corresponds to objects with Legacy objects {i = 6), and 
GAL.CMASS^LL which includes GAL.CMASS and the 
entire sparsely sampled region (i = 7). 

• The Portsmouth stellar mass catalogs can be 
found at https://www.sdss3.org/drl0/spectro/galaxy_ 
port smouth.php. 

• The DRIO large-scale structure cat¬ 

alogs (Anderson et al. 2014) that we use 
are galaxy _DR10v8_CMASS_South.fits and 

galaxy _DR10v8_LOWZ_South.fits. These catalogs can 
be found at http://data.sdss3.org/sas/drl0/boss/lss/ 
and the data model can be found at http://data. sdss3. 
org/datamodel/files/B0SS_LSS_REDUX/galaxy_DR10v8_ 
SAMPLE_NS.html. 
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